Round 1 - Technical (Data Pipeline Design)
🔹 Discussion around my past experience and projects, some resume based questions
🔹 Conceptualization of Real time streaming data pipeline
🔹 Length discussion around Apache Kafka Concepts like - Offset management, Sync & Async commits, Partition assignment, Consumer groups, Back pressure in Kafka
🔹 How to use Docker to scale real time data streaming applications?
🔹 Deployment process with CICD
Round 2 - Coding
🔹 Write the code to calculate the power of a given number in minimum time complexity? (Hint : I used recursion with dynamic programming concept to solve it in O(logn) time complexity)
🔹 Given the Infix, Postfix or Prefix notation of an expression, write the code to find the final result of the given expression? (Hint: This is standard Stack data structure related question)
Round 3 - Technical
🔹 Apache Spark related questions - Fundamentals, OOM scenarios in Spark and how to fix them, how to optimize an Spark application, Optimized joins in Spark, Data skewness & Key salting technique
🔹 Apache Kafka fundamentals
🔹 SQL queries related to Joins and Group By together
Round 4 - Hiring Manager
🔹 Discussion around my past experience and projects, some resume based questions
🔹 He wanted to know about my good and bad experiences with past employers
🔹 How will you work in a team for tight project delivery timelines
🔹 Why did you leave McKinsey & Company in just 4 months?
🔹 What are you expecting in your next job role?